The Last-Step Minimax Algorithm
نویسندگان
چکیده
We consider on-line density estimation with a parameterized density from an exponential family. In each trial t the learner predicts a parameter t. Then it receives an instance xt chosen by the adversary and incurs loss ln p(xtj t) which is the negative log-likelihood of xt w.r.t. the predicted density of the learner. The performance of the learner is measured by the regret de ned as the total loss of the learner minus the total loss of the best parameter chosen o -line. We develop an algorithm called the Last-step Minimax Algorithm that predicts with the minimax optimal parameter assuming that the current trial is the last one. For one-dimensional exponential families, we give an explicit form of the prediction of the Last-step Minimax Algorithm and show that its regret is O(lnT ), where T is the number of trials. In particular, for Bernoulli density estimation the Last-step Minimax Algorithm is slightly better than the standard Krichevsky-Tro mov probability estimator.
منابع مشابه
A Strategy to Enforce the Discrete Minimax Principle on Finite Element Meshes
Low quality meshes in threedimensional finite element diffusion simulations often violate the discrete Minimax principle. An indication of this shortcoming is the occurrence of negative concentrations. We present an a posteriori refinement algorithm to enforce the discrete Minimax principle by locally refining the mesh based on the last time-step. Two examples of diffusion simulations where neg...
متن کاملConvergence Results of a Local Minimax Method for Finding Multiple Critical Points
In [14], a new local minimax method that characterizes a saddle point as a solution to a local minimax problem is established. Based on the local characterization, a numerical minimax algorithm is designed for finding multiple saddle points. Numerical computations of many examples in semilinear elliptic PDE have been successfully carried out to solve for multiple solutions. One of the important...
متن کاملTruncated Linear Minimax Estimator of a Power of the Scale Parameter in a Lower- Bounded Parameter Space
Minimax estimation problems with restricted parameter space reached increasing interest within the last two decades Some authors derived minimax and admissible estimators of bounded parameters under squared error loss and scale invariant squared error loss In some truncated estimation problems the most natural estimator to be considered is the truncated version of a classic...
متن کاملRandom Permutation Online Isotonic Regression
FORWARD ALGORITHMS Two observations: • PAVA is efficient and generalizes to partial orders • Follow The Leader algorithms are common in practice Forward Algorithm: To predict at xt, imagine y′ t ∈ [0, 1], compute f∗ on {(x1, y1) . . . (xt−1, yt−1)} ∪ {(xt, y′ t)}, and play ŷt = f(xt). FORWARD ALGORITHM EXAMPLES • IR-Int: Compute f∗ on past data. Predict with average of f∗ at nearest xi. • Inter...
متن کاملAchievability of Asymptotic Minimax Regret in Online and Batch Prediction
The normalized maximum likelihood model achieves the minimax coding (log-loss) regret for data of fixed sample size n. However, it is a batch strategy, i.e., it requires that n be known in advance. Furthermore, it is computationally infeasible for most statistical models, and several computationally feasible alternative strategies have been devised. We characterize the achievability of asymptot...
متن کامل